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DEFECTS TN DRUG METABOLISM 
FTELD OF THE IN VENTION 

The invention relates to genetic material, 
specifically primers, for use in a method designed to 
5 determine the genotype of an individual; and also a kit, 
including the genetic material of the invention, for 
performing the method of the invention. 

BACKGROUND OF THE TNVENTION 

10 jt is well known that genetic polymorphisms in 

drug metabolizing genes give rise to a variety of 
phenotypes. This information has been used to advantage 
in the past for developing genetic assays that predict 
phenotype and thus predict an individual's ability to 

15 metabolize a given drug. The information is of particular 
value in determining the likely side effects and 
therapeutic failures of various drugs. The availability 
of this sort of information will result in routine 
phenotyping being recommended for certain categories of 

20 patients. 

Drug metabolism is carried out by the cytochrome 
P4 50 family of enzymes. For example, the cytochrome P4 50 
isozyme gene, CYP2C9 encodes a high affinity hepatic 
[S]-warfarin 7-hydroxylase which appears to be principally 

25 responsible for the metabolic clearance of the most potent 
enantiomer of warfarin. Similarly, the cytochrome P4 50 
isozyme gene, CYP2A6, encodes a protein that metabolizes 
nicotine and coumarin and activates the tobacco-specific 
nitrosamine 4- (methyinitrosamino) -1- (3-pyridyl) - 

30 l-butanone) (NNK) . 

It is of note that the above gene products are 
also known to metabolize other substrates, for example, 
the CYP2C9 gene product is also known to metabolize 
Tolbutamide, Phenytoin, Ibuprofen, Naproxen, Tienilic 

35 acid, Diclofenac and Tetrahydrocannabinol. 

It follows that genetic polymorphisms or 
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mutations in either of the two aforementioned genes can 
lead to an impairment in metabolism of at least the 
aforementioned drugs. 

In so far as CYP2C9 is concerned, sequences 
reported by Yasumori et al (1987 J. Biochem. 102:1075- 
5 1082.) and Kimura et al (1987 Nuc. Acids Res. 15:10053- 

10054) show differences at several positions including a C 
to T base change that results in a Arginine/ Cysteine 
polymorphism at amino acid 144. This polymorphism has 
been designated R144C. 
10 In so far as CYP2A6 is concerned, a T to A base 

change at position 488 of the cDNA sequence described by 
Yamano et al (1990 Biochemistry 29:1322-1329) results in 
substitution of Leucine 160 by Histidine. Henceforth this 
mutant form of the gene will be designated CYP2A6vl. 
15 The variant CYP2A6vl encodes an enzyme that is 

unstable and catalytically inactive. It is found in the 
general population at a frequency of about 1% but does not 
account for all slow metabolizers of coumarin. 

Since the cDNA sequence structure of CYP2C9 and 
20 CYP2A6 are known, and since it is also known to perform 

genetic assays to determine whether a preselected mutation 
is present within a given gene, it should, in theory, be 
possible to design assays which specifically determine 
whether either of the aforementioned mutations are present 
25 in each of the respective aforementioned genes. 

However, we have found an extraordinarily high 
degree of exon homology in the cytochrome P450 genes. 
This has resulted in non-specific binding of assay 
materials and poor performance of assays. In the instance 
30 where primers have been used to hybridize to genetic 

material, non-specific binding of such primers has taken 
place, and in the further instance where primers have been 
used to hybridize to genetic material with a view to 
performing a polymerase chain reactions we have found that 
35 related genes have also been amplified, for example, 
CYP2A7, CYP2A12 and CYP2C8 have also been amplified. 



WO 95/34679 PCT/US95/07605 



- 3 - 

SUMMARY OF THE INVENTION 
The present invention relates to novel variant 
alleles in cytochrome P4 50 genes which express enzymes 
involved in the metabolism of particular drugs and/or 
chemical carcinogens. 
5 One object of the present invention relates to 

the discovery of new mutant or variant CYP2A6 alleles 
wherein the human gene is characterized. A new variant 
allele has been found which is designated CYP2A6v2. The 
cDNA and genomic sequence of CYP2A6v2 is provided in the 
10 present invention. Another new gene related to 

CYP2A6 has been discovered and is designated CYP2A13 . The 
cDNA and. genomic sequence of CYP2A13 is provided in the 
present invention. 

Another object of the present invention relates 
15 to the use. of intron sequences to specifically identify* 
CYP2A6 and CYP2C9 variants in a gene specific detection- 
assay. 

Another object of the present invention is to; 
use an oligonucleotide probe, specific for regions unique 
20 to a particular CYP2 variant to screen for the presence or 
absence of the variant in a sample. 

Yet another object of the invention is to 
provide genetic material, a method, and a kit which enable 
genotyping of the CYP2C9 and CYP2A6 gene with a view to 
25 providing phenotypic information concerning drug 
metabolism. 

A further object of the present invention 
provides a method for diagnostically determining the 
sensitivity of a patient for specific drugs and chemical 
30 carcinogens. Such a method is widely applicable in 

determining the proper dosage of a drug for a patient. 

Another object of the present invention provides 
a method of genotyping CYP2A6 and CYP2C9 and determining 
whether a mutation has altered the sequence of these genes 
35 and hence altered sensitivity to particular drugs and 

chemical carcinogens. 
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In accordance with the present invention a 
method is provided which utilizes the finding that each 
variant of a CYP2 gene has specific nucleotide differences 
as compared with the wild-type CYP2 gene. Such nucleotide 
changes can be utilized in a probe-hybridization assay, 
which is capable of specifically detecting a chosen 
variant and not other variants* 

The present invention also provides a genotyping 
method for identifying the presence or absence of a 
mutation at codon 144 of the coding sequence of CYP2C9 , or 
alternatively, at codon 160 of the coding sequence of 
CYP2A6 , or alternatively, a gene conversion event 
involving CYP2A6 and CYP2A7 in exons 3, 6 or 8 comprising 
use of a portion of DNA. Such a mutation is then 
correlated to the sensitivity of particular drugs and 
chemical carcinogens . 

The present invention further relates to a gene- 
specific bioassay which is capable of distinguishing 
between the CYP2 genes and identify the presence or 
absence of a mutation in CYP2A6 and CYP2C9 genes. Such a 
bioassay can diagnostically predict the sensitivity of an 
individual to particular drugs or chemical carcinogens. 
For example, the CYP2C9 variants identify a sensitivity to 
a commonly used anti-coagulant drug, warfarin. The CYP2A6 
variants identify sensitivity to coumarin, nicotine and 
nitrosamines . The sensitivity to nicotine may be used to 
predict a predisposition to tobacco-related diseases, a 
propensity to smoking and adverse reactions to exposure to 
nicotine. Further, CYP2A6 genes are associated with the 
activation of nitrosamines, elevated levels of which have 
been correlated with many cancers. 

The present invention also provides a method of 
genotyping the CYP2A6 and CYP2C9 genes using allele- 
specific amplification reaction. 

In addition, a highly-specific combination 
genotyping bioassay has been developed to identify 
mutations within CYP2A6 and CYP2C9 which are linked to 
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sensitivity to particular drugs and chemical carcinogens. 
This combination bioassay comprises a gene-specif ic 
amplification reaction, an exon-specif ic amplification 
reaction and an endonuclease cleavage reaction wherein 
only one form, either mutant or wild-type is cleaved, 
5 producing either a single nucleic acid fragment or 
multiply nucleic acid fragments depending upon the 
presence or absence of the mutation. For example, one 
CYP2C9 variant, R144C, which contains a C 472 -T mutation can 
be identified by an Avail restriction site. CYP2A6 
10 variants can also be identified by their corresponding 

mutations. CYP2A6vl which contains a T 48S -A mutation can 
be identified by a Xcml restriction site. CYP2A6V2 which 
contains a T 415 -A mutation can be identified by a Ddel 
restriction site. 
15 The present invention also relates to a method 

for screening patients for drug sensitivity prior to their 
treatment with that drug, thereby alerting a physician of 
a drug sensitivity. In addition, the method may be used! to 
screen patients for a predisposition to cancers related. to 
20 excessive nitrosamine activation, which are associated * 

with mutations within the CYP2A6 gene locus. Further, the 
method may be used to screen patients for a sensitivity to 
chemical carcinogens, based upon the genotype of the 
CYP2A6 and/or CYP2C9 alleles. 
25 one such new allele variant, CYP2A6v2, has 98% 

nucleotide similarity and 80% amino acid similarity with 
the wild type CYP2A6, respectively. The present invention 
relates to the new CYP2A6v2 variant, the cDNA sequence and 
its genomic sequence wherein the alterations in sequence 
30 are within exons 3, 6 and 8, which are attributed to a 
gene conversion. In addition, another new gene, also 
involved in drug metabolism has been identified, and has 
been designated CYP2A13. This gene plays a similar role 
in drug metabolism as CYP2A6 . These new gene sequences or 
35 fragments thereof are used as probes in identifying 
specific CYP2 variants in samples. In additions, 
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fragments of the new genes are used as primers in a 
genotyping assay. 

The invention further provides isolated CYP2AV2 
and CYP2A13 cDNAs for use in gene therapy and replacement 
protocols for individuals who are predisposed to 
^ sensitivity to needed drugs or to chemical or 
environmental carcinogens . 

In accordance with an aspect of the present 
invention, there are provided primary human cells which 
are genetically engineered with CYP2A6v2 or CYP2A13 DNA 
10 (RNA) which encodes a therapeutic agent of interest, and 
the genetically engineered cells are employed as a 
therapeutic agent. (The term "therapeutic , " as used 
herein, includes treatment and/or prophylaxis.) 

Gene expression in an organism in accordance 
^ with the practices of this invention is regulated, 

inhibited and/or controlled by incorporating in or along 
with the genetic material of the organism non-native DNA 
which transcribes to produce an RNA which is complementary 
to and capable of binding or hybridizing to a mRNA 
20 produced by a gene located within said organism. Upon 

binding to or hybridization with the mRNA, the translation 
of the mRNA is prevented. Consequently, the protein coded 
for by the mRNA is not produced. In the instance where 
the mRNA translated product, e.g. protein, is vital to the 
25 growth of the organism or cellular material, the organism 
is so transformed or altered such that it becomes, at 
least, disabled. 

Accordingly, in the practices of this invention 
from a genetic point of view as evidenced by gene 
30 expression, new organisms are readily produced. Further, 
the practices of this invention provide a powerful tool or 
technique for altering gene expression or organisms 
through gene therapy. The practices of this invention may 
cause the organisms to be disabled or incapable of 
35 functioning normally or may impart special properties 

thereto. The DNA of CYP2A6v2 or CYP2A13 employed in the 
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practices of this invention can be incorporated into the 
treated or effected organisms by direct introduction into 
the nucleus of a eukaryotic organism or by way of a 
plasmid or suitable vector containing the special DNA of 
this invention in the case of a procaryotic organism. 

5 

BRIEF DESCRIPTION OF THE DRAWINGS 
Embodiments of the invention are described by 
way of example only with reference to the accompanying 
^ figures wherein: 

Fig. 1- Shows the sequence of exon 2, intron 2 
and exon 3 of CYP2C8 and CYP2C9 , cDNA sequences (from 4) 
are shown at the top of the page together with sequences 
from 6 genomic clones encompassing exon 2, intron 2 and 
15 exon 3 of CYP2C8 and CYP2C9. The position of the 

polymorphism at codon 144 of CYP2C9 and the PCR primers 
are indicated. 

Fig. 2. Shows the sequence of intron 2, exon 3 
and intron 3 of CYP2A6 , CYP2A7 and CYP2A12 . The position 
20 of the polymorphism at codon 160 in CYP2A6 and the PCRv 
primers are indicated. 

Fig. 3. Shows the detection of CYP2C9 Arg U4 Cys 
polymorphism by PCR. Following amplification, samples 
were digested with Avail and analyzed on a 1.8 % agarose 
25 gel . Lane I and lanes 3 to 6 show homozygous wild-type 
subjects, lane 2 a heterozygous individual and lane 7 
undigested PCR product. 

Fig. 4. Shows detection of CYP2A6 Leu 160 . His 
polymorphism by PCR. Two parallel PCR reactions were 
30 carried out and the products analyzed on a 1 % agarose 

gel. Lanes 1, 3, 5 and 7 show the results of the wild-type 
specific assay and lanes 2, 4, 6 and 8 the results of the 
variant-specific assay for the same four subjects. 
Subjects I and 2 (lanes 1-4) are homozygous wild-type, 
35 subject 3 (lanes 5 and 6) heterozygous and subject 4 

(lanes 7 and 8) homozygous for the mutation. 
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Fig. 5. Shows distribution of the weekly 
maintenance doses for warfarin in patients (n=57) 
homozygous for the CYP2C9 wild-type allele (open bars) and 
heterozygous (n=37) for the R144C mutant allele (solid 
bars) . Arrows show the median weekly dose requirement of 
warfarin for each genotype. 

Fig. 6. Represents 7-hydroxylation of coumarin 
(%) in a family genotyped for the CYP2A6 and CYP2A6vl 
alleles, showing a subject homozygous for the CYP2A6vl 
allele who is deficient in coumarin 7-hydroxylation. 

Fig. 7. Shows the difference between the genomic 
and cDNA sequences for the CYP2A6 gene. 

Figs. 8a and b. Shows the conversion event which 
leads to the CYP2A6v2 allele. 

Figs. 9a through 9c. Shows the detection of 
15 CYP2A6V2 by PCR. (Fig. 9A) gene-specific amplification by 
PCR of the CYP2A6 gene using E3F and E3R. Lanes 1 to 4 
show the 7.8 Kb band obtained from several representative 
human genomic DNA templates, lane 5 correspond to a 
negative control in the absence of template and lane 6 
20 contains 1 Kb DNA ladder (GIBCO BRL) as six markers. 

(Fig. 9B) Exon-specif ic PCR amplification of exon 3 from 
the 7.8 Kb long-PCR product and restriction endonuclease 
pattern obtained after digestion with Xcjnl (left) and Ddel 
(right) to detect the CYP2A6vl and CYP2A6v2 alleles, 
25 respectively. The genotypes shown correspond to: wild type 
(+/+) / heterozygous (+/-) and homozygous (-/-) subjects. 
(C) The genotyping strategy which has been developed. 
Exons are indicated by boxes. The position of the 
corresponding primer pairs are indicated by horizontal 
30 arrows. Xcml and Ddel restriction sites generate 

digestion patterns for the different alleles having 
fragment sizes as shown. 

Fig. 10. Schematic diagram depicting 
methodology underlying a CYP2C9 genotyping assay. 
35 Fig. 11. CYP2A6v2 cDNA sequence. 

Fig. 12. CYP2A6V2 genomic DNA sequence having 
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7216 base pairs. 

Fig. 13. CYP2A13 cDNA sequence. 

Fig. 14. CYP2A13 genomic DNA sequence having 

8779 base pairs. 

Fig. 15. Agarose minigel electrophoresis of PCR 
5 products. The CYP2C9 wild-type allele (Arg-144) and R144C 
respectively, Lanes marked "+/+" *nd "+/-" contain 
homozygous wild types and heterozygotes respectively. the 
right-hand lane contains a 100 bp ladder. 

nBT&TT.ED DESPPTPTTON THF TNVENTION 

10 The cytochrome P450 isozyme gene, CYP2C9 encodes 

a high affinity hepatic [S]-warfarin 7 -hydroxylase which 
appears to be principally responsible for the metabolic 
clearance of the most potent enantiomer of warfarin along 
with metabolizing a number of other drugs and chemical 
15 carcinogens. Similarly, the cytochrome P450 isozyme gene, 
CYP2A6, encodes a protein that metabolizes nicotine, 
coumarin and a host of other drugs and chemical 
carcinogens CYP2A6 also activates the tobacco-specific : 
nitrosamine 4- (methylnitrosamino) -1- ( 3-pyridyl) - 
20 i-butanone (herein referred to as "NNK" ) . Many cancers 
have been associated with activation and/or accumulation 
of nitrosamines. The present invention allows detection of 
a predisposition to such cancers. 

It is of note that the above gene products are 
25 also known to metabolize other substrates. For example, 
the CYP2C9 gene product is also known to metabolize 
Tolbutamide, Phenytoin, Ibuprofen, Imipramine, Naproxen, 
Tienilic acid, Diclofenac and Tetrahydrocannabinol and 
hence can also be used to detect sensitivities to these 
30 drugs. A list of CYP2C9 drug substrates has been 
documented and is incorporated herein by reference 
(Gonzalez & Idle 1994 Clin. Pharmacokinet 26:59-70). 
Hence, the present invention can be used to screen for 
sensitivities to these drugs. 
35 In addition, CYP2C9 has been associated with the 

metabolism of chemical carcinogens, such as polycyclic 
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aromatic hydrocarbons. For example, the most ubiquitous 
environmental carcinogen, benz- [a] -pyrene is metabolized 
by CYP2C9 . Benz- [a] -pyrene is found in tobacco, barbecued 
meats, car exhaust and generally, in polluted air. This 
compound, as it accumulates in the body becomes a potent 
DNA intercalating agent, ultimately resulting in cell 
transformation and the formation of tumors. The present 
invention provides a diagnostic method of screening 
individuals for their ability to metabolize and hence 
inactivate benz- [a] -pyrene. For example, a homozygote 
wild-type CYP2C9 individual would be better able to 
tolerate high levels of benz- [a] -pyrene than a 
heterozygote of the CYP2C9 allele. 

Similarly, the CYP2A6 allele is associated with 
drug sensitivity and carcinogen metabolism. Coumarin 
sensitivity is directly related to the presence of a 
variant CYP2A6 allele, such as CYP2A6vl, CYP2A6V2 and also 
CYP2A13. Coumarin is a drug used in treatment of 
neoplastic diseases, such as lymphomas. (See Martindale: 
The Extra Pharmacopoeia 1993 Ed. Reynolds, J.E.F., The 
Pharmaceutical Press, London, p. 1358). Its suggested 
dosage is very high. Therefore, the present invention is 
useful in determining a patient's sensitivity to the drug 
in order to prescribe a proper dosage and avoid toxicity. 

Another drug, Thiotepa* , is used in the 
treatment of a variety of neoplastic diseases, such as in 
treating women with breast cancer and children with brain 
tumors. Thiotepa is metabolized by CYP2A6 into Tepa, 
which is an intermediate more therapeutically potent than 
Thiotepa. Therefore, if a patient has a very active 
CYP2A6 enzyme, it is likely the patient will require lower 
doses of Thiotepa to provide a therapeutically effective 
amount. As one can see, the dosage provided to a patient 
is dependent upon the rate a patient is capable of 
metabolizing activating the drug. The present invention 
35 has identified variant alleles whose enzymatic activity is 
compromised. In addition, the present invention provides 
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a simple method of genotyping patients for Thiotepa drug 
sensitivity. With information concerning patient 
sensitivity to such drugs, the proper dosage can be 
provided, hence maximizing drug efficiency and minimizing 

drug toxicity. 
^ Further, CYP2A6 has been associated with 

nicotine metabolism. In addition to being an active 
ingredient in tobacco, nicotine also has several clinical 
uses. Nicotine is used clinically to treat various 
neurological disorders, such as Parkinson's disease and 
10 Alzheimer's disease. In addition, nicotine is used to 

treat tobacco addiction. In all of these situations, it 
is important to know a patient's sensitivity to nicotine, 
since extremely sensitive patients will become violently 
ill upon administration of nicotine. Therefore the 
15 present invention provides a method of identifying 

nicotine-sensitive patients by genotyping a patient's ... 
CYP2A6 allele. The present invention also provides a 
convenient method for determining an individual's general 
predisposition to using tobacco based upon their 
20 sensitivity to nicotine. 

In addition, CYP2A6 is involved in activating 
nitrosamines, thereby producing the potent carcinogen NNK. 
Increased levels of NNK have been associated with a 
variety of cancers, including but not limited to lung 
25 cancer, nasal-pharynx cancers, throat cancers and colon 
cancers. In general, elevated levels of CYP2A6 has been 
associated with cancers associated with exposure to 
nitrosamines. The present invention may detect a 
patient's predisposition to such cancers. The presence of 
30 a CYP2A6 gene or a variant thereof will affect the 

likelihood that procarcinogens present in tobacco smoke 
will be activated into carcinogenic nitrosamines and 
nitrosamine-derivatives and therefore result in the 

development of a cancer. 
35 it follows that genetic polymorphisms or 

mutations in either of the two aforementioned genes can 
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lead to an impairment in metabolism of at least the 
aforementioned drugs and chemical carcinogens. 

The present invention relates to the 
identification of the absence or presence of mutations in 
CYP2C9 and CYP2A6 and thus predict the phenotype of an 
^ individual and so predict whether and how an individual is 
likely to metabolize particular drugs and chemical 
carcinogens. For instance, the R144C mutation arising 
from a C 472 -*T base substitution in the CYP2C9 gene results 
in a reduction in warfarin metabolism. This implies that 

^ patients with this mutation receiving warfarin require a 
lower dose to maintain an anticoagulation target than 
those patients who do not have the mutation and are also 
receiving warfarin. Conversely, homozygous wild-types 
require higher doses in order to maintain an 

^ anticoagulation target. 

"Mutation", as the term is used herein denotes 
an allelic variation of a known sequence, which alters the 
expressed gene product's activity. Such a variation need 
not completely inactivate the gene product's activity but 

20 merely alter it. 

Similarly, one mutation within CYP2A6vl arising 
from a T 4S8 -*A base change results in substitution of 
Leucine 160 by Histidine. Another CYP2A6 variant, 
CYP2A6V2, has been identified which differs from CYP2A6 in 

25 the regions of exons 3, 6 and 8. One particular mutation 
in CYP2A6v2, T 415 ->A mutation is useful in the assay of the 
present invention. These substitutions are very useful in 
detecting predispositions to cancers associated with 
tobacco and activation of nitrosamines . The normal CYP2A6 

30 enzyme functions in the metabolism of nicotine, one of the 
carcinogenic compounds in tobacco. 

In addition, the present, invention relates to 
the identification of a new variant of CYP2A6 designated 
CYP2A6V2. The variations of CYP2A6v2 from CYP2A6 bear 

35 sequence relatedness with the corresponding exons of the 

CYP2A7 gene, suggesting a recent gene conversion. The cDNA 
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and genomic sequence for this gene is provided in the 
present invention. Hence, at least three different 
allelic variants of CYP2A6 exist and are illustrated in 
Figure 8. These allelic variants include CYP2A6, CYP2A6vl 
and CYP2A6V2. 

5 Further, the present invention relates to a new 

CYP2A gene, designated CYP2A13. This gene produces an 
inactive form of CYP2A6, however variants at particular 
positions, including amino acid positions 117, 209 and 365 
produce an enzyme which may alter the enzyme's activity 
10 and hence affect drug sensitivity. These mutations in 
CYP2A6 are likely to result in a deficiency or impaired 
activity. of one of the enzymes responsible, for example, 
for metabolizing drugs, nicotine and nitrosamines . 

CYP2A13 is considered a new cytochrome P450 
15 gene. However, since the CYP2A13 gene product has a 

similar function as the CYP2A6 , it is discussed herein as 
a variant of CYP2A6 . That is, assays using the specific 
mutated amino acid positions 117, 209 and 365 of CYP2A13 
and detecting variations at those positions are indicative 
20 of CYP2A6-like variant functions. 

In one embodiment, the CYP2A6v2 or CYP2A13 
proteins or functional portions thereof are expressed as 
recombinant genes in a cell, so that the cells may be 
transplanted into an individual in need of gene therapy 
25 due to the predisposition to a carcinogen-associated 
cancer or a sensitivity to a drug. To provide gene 
therapy to an individual, a genetic sequence which encodes 
for all or part of the CYP2A6v2 or CYP2A13 ligands are 
inserted into vectors and introduced into host cells. 
30 Examples of vectors that may be used in gene therapy 

include, but are not limited to, defective retroviral, 
adenoviral, or other viral vectors (see, e.g., Mulligan, 
R.C., 1993, Science , 260:926-932). The means by which the 
vector carrying the gene may be introduced into the cell 
35 includes, but is not limited to, microinjection, 

electroporation, transduction, or transfection using DEAE- 
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dextran, lipofection, calcium phosphate or other 
procedures known to the skilled routineer (see, e.g., 
Sambrook et. al. (Eds.), 1989, In "Molecular Cloning. A 
Laboratory Manual", Cold Spring Harbor Press, Plainview, 
New York) . Examples of cells into which the vector 
carrying the gene may be introduced include, but are not 
limited to, continuous culture cells, such as COS, 
NIH/3T3, and primary or culture cells of the relevant 
tissue type. 

More specifically, there is provided a method of 
enhancing the therapeutic effects of blood cells, that are 
infused in a patient, comprising: (i) inserting into the 
blood cells of a patient a DNA (RNA) segment encoding 
CYP2A6V2 or CYP2A13 gene product that enhances the 
therapeutic effects of the blood cells; and (ii) 
introducing cells resulting from step (i) into the patient 
under conditions such that the cells resulting from step 
(i) "target" to a tissue site. In the alternative, as 
previously described the cells are not "targeted" and 
functions as a systemic therapeutic. The genes are 
inserted in such a manner that the patient's transformed 
blood cell will produce the agent in the patient's body. 
In the case of antigen-specific blood cells which are 
specific for an antigen present at the tissue site, the 
specificity of the blood cells for the antigen is not lost 
when the cell produces the product. 

Alternatively, as hereinabove indicated, 
CYP2A6V2 or CYP2A13 DNA (RNA) may be inserted into the 
blood cells of a patient, in vivo, by administering such 
DNA (RNA) in a vehicle which targets such blood cells. 

Further details regarding methods of gene 
therapy are provided in Anderson et al., U.S. Patent No. 
5,399,34 3 which is herewith incorporated herein by 
reference . 

In another embodiment, antisense CYP2A6v2 or 
35 CYP2A13 DNA or RNA may be used to control the expression 
of CYP2 gene. For example, antisense therapy may be used 
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to control CYP2A6's ability to activate dangerous 
nitrosamines by curbing its expression. Methods of 
producing such antisense molecules are described in U.S. 
Patent No. 5,190,931, which is incorporated herein by 
reference. 

Developing a genotyping assay, which could 
distinguish the CYP2 genes of interest from other 
cytochrome P450 genes required careful engineering since 
these genes have a high degree of sequence homology. To 
overcome this problem, one embodiment of the present 
invention has elucidated the genomic sequence structure of 
CYP2C9 and CYP2A6 with a view to making, in part, intron 
specific primers. That is to say primers which, in part, 
hybridize to at least one intron, preferably an intron 
adjacent to an exon including the mutation of interest, in 
15 the gene to be examined. Since there is less homology 

between the introns of cytochrome P4 50 genes, it has been 
found that using intron specific primers, gene specific 
assay can be undertaken. The present invention has a- 
further advantage of using intron specific primers in .so 
20 far as the use of such primers facilitates the manufacture 
of an optimum length of DNA which in turn facilitates -the 
specificity of the instant bioassay. 

A "genotyping" assay as the term is used herein 
refers to any diagnostic or predictive test to detect the 
25 presence or absence of allelic variants of a known gene 

sequence at a specified gene locus. Two gene loci are of 
particular interest in the present invention, CYP2A6 and 
CYP2C9. 

Further, the present invention relates to 
30 differences between the genomic DNA sequence structure and 
the cDNA sequence structure, as illustrated in Figure 7. 
As a result, primers directed at the genomic sequence 
structure have been developed which are more reliable. 

Several methods are provided for identifying the 
35 presence or absence of a mutation at codon 144 of the 

coding sequence of CYP2C9, or alternatively, at codon 160 
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of the coding sequence of CYP2A6, or alternatively, a gene 
conversion event involving CYP2A6 and CYP2A7 in exons 3, 6 
or 8 comprising a DNA encompassing the region of a CYP2 
gene unique to that variant. 

One such method relates to an assay which 
contemplates the use of one specific primer which 
specifically encompasses the region containing the 
mutation, and a second primer which is complementary to 
another portion of the gene. The second primer sequence 
chosen is based upon the CYP2A6, CYP2C9 or CYP2A13 
sequences as set forth in figures 12, 1 and 14, 
respectively, depending upon the preferred size of the 
amplification product. One skilled in the art will know 
how to select second primer based on the region of gene 
chosen for amplification. These primers need not be 
identical to a given sequence but must be sufficiently 
complementary to hybridize to the target region in a 
specific manner. In short, the primers are preferably at 
least substantially homologous to the nucleic acid 
sequence provided ♦ 

Nucleic acid sequences includes, but is not 
limited to, DNA, RNA or cDNA. Nucleic acid sequence as 
used herein refers to an isolated nucleic acid sequence. 
Substantially homologous as used herein refers to 
substantial correspondence between the nucleic acid primer 
sequence of as described herein and that of any other 
nucleic acid sequence. Substantially homologous means 
about 50-100% homologous homology, preferably by about 70- 
100% homology, and most preferably about 90-100% homology 
between the particular sequence discussed and that of any 
30 other nucleic acid sequence. 

In the instant application, the term "primer" is 
further used to designate a molecule comprising at least 
three nucleotides, the exact length being determined by 
the requisite amount of DNA needed, under given reaction 
35 conditions, to bind to or interact with a test sample so 
as to identify the presence or absence of either of said 
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mutations. Preferably, the primer is usually between 15 
and ideally about 20 to 50 oligonucleotides in length. 

The primer is selected, or adapted, to be 
substantially complementary to a part of DNA which is 
adjacent to the region of at least one of the 
aforementioned mutations. Thus such a primer is able is 
hybridize with a part of DNA that contains a region in 
which the mutation of interest may be found. Although the 
primer may not reflect the exact sequence of the region in 
which the mutation is thought to occur, the more closely 
the primer is to this sequence, then the better the 
binding will be. Ideally, the more closely the sequence 
of the 3' end of the primer is to said region the better 
the binding or interaction will be. 

An alternative method for using the sequence 
15 unique to a variant for detection relates to use of an 
oligonucleotide probe for specifically detecting the , 
presence or absence of a CYP2 variant gene in a sample, 
this method comprises the steps of contacting the sample 
with a nucleic acid probe, allowing hybridization, forming 
20 a probe: CYP2 variant complex; washing excess probe from 
probe: CYP2 variant complex; and detecting probe: CYP2 
variant complex, wherein a positive signal is an 
indication of the presence of the CYP2 variant in the 
sample . 

25 The hybridization of the probe to sample nucleic 

acids can be carried out by any of the methods commonly 
used in the art. Such methods include but are not limited 
to, Dot blot, Colony hybridization, Southern blot, 
solution hybridization and in situ hybridization. 
30 washing the excess probe from the probe: CYP2 

variant DNA can be accomplished by many well-known 
methods. Simply rinsing the complex with excess buffer 
will facilitate removal of excess probe; Alternatively, 
washing may entail separating the probe: CYP2 variant 
complex from excess probe. Many methods are known to one 
skilled in the art and include but are not limited to 
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centr if ligation, filtration and magnetic force. 

According to the present invention there is 
provided a portion of DNA suitable for use as a primer in 
a method for identifying the presence or absence of a 
mutation either at codon 144 of the coding sequence of the 
^ gene CYP2C9, or alternatively, at least one gene 

conversion event involving CYP2A6 and CYP2A7 in exons 3, 6 
or 8 , or alternatively, at codon 160 of the coding 
sequence of the gene CYP2A6; comprising a DNA which is 
adapted to hybridize to at least one intron of at least 

^ one of said genes . 

In one embodiment, the method comprises the use 
of at least one restriction endonuclease to digest DNA 
from individuals to be tested. In this instance, DNA from 
individuals positive for the wild-type form of CYP2C9 

15 provide a digest with a restriction endonuclease, such as 
Avail results in production of two fragments, a first 
fragment including 270 base pairs and a second fragment 
including 50 base pairs* In contrast, individuals having 
the aforementioned mutation in CYP2C9 present a single 

20 fragment of 320 base pairs only. This is due to a loss of 
the Avail site. The CYP2A6 gene variants can also be 
distinguished by the occurrence of specific restriction 
endonuclease sites. The CYP2A6vl variant, which is a 
T A8S ->A mutation in exon 3 can be identified by a variant- 

25 specific Xcml restriction site. The CYP2A6v2 variant, 
which contains a C 415 -*A mutation within exon 3 can be 
identified by a variant-specific Ddel restriction site. 
The wild-type CYP2A6 gene does not contain either an Xcml 
or Ddel site. The results of such restriction 

30 endonuclease digestions are illustrated in Figure 9. 

It may be necessary to amplify the DNA prior to 
digestion. Such may be the case when the DNA of interest 
is present in minute quantities in a sample. In such 
circumstances, amplification of DNA to be tested is 

35 undertaken before digesting the DNA as described above. 

This provides for a greater quantity of materials. 
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Amplification is performed using any conventional 
technique, such as by a PCR reaction. Many other 
techniques for amplification can be used in producing 
sufficient DNA for detections. Such amplification 
techniques are well-known to the skilled artisan and 
include, but are not limited to polymerase chain reaction 
(PCR), PCR in situ, ligase amplification reaction (LAR) , 
ligase hybridization, QB bacteriophage replicase, 
transcription-based amplification system (TAS) , genomic 
amplification with transcript sequencing (GAWTS) and 
nucleic acid sequence-based amplification (NASBA) . A 
general review of these methods is available in Landegren, 
et al., Science 242:229-237 (1988) and Lewis, R. , Genetic 
Engineering News 10:1, 54-55 (1990), which is incorporated 
herein by reference. 
15 one embodiment of the present invention uses.-. 

oligonucleotide primers in an amplification and detection 
assay. A basic description of nucleic acid amplification 
is described in Mullis, U.S. Patent No. 4,683,202, which 
is incorporated herein by reference. The amplification 
20 reaction uses a template nucleic acid contained in a ,« 
sample, two primer sequences and inducing agents. The 
extension product of one primer when hybridized to the 
second primer becomes a template for the production of a 
complementary extension product and vice versa, and the 
25 process is repeated as often as is necessary to produce a 
detectable amount of the sequence. 

The inducing agent may be any compound or system 
which will function to accomplish the synthesis of primer 
extension products, including enzymes. Suitable enzymes 
30 for this purpose include, for example, E.coli DNA 

polymerase I, thermostable Tag DNA polymerase, Klenow 
fragment of E.coli DNA polymerase I, T4 DNA polymerase, 
other available DNA polymerases, reverse transcriptase and 
other enzymes which will facilitate combination of the 
35 nucleotides in the proper manner to form amplification 
products . 
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A sample being screened for the presence or 
absence of a mutation in CYP2A6 and/or CYP2C9 genes can be 
tested with the instant invention. The nucleic acid 
material can be in purified or nonpurified form, provided 
the sample contains the CYP2A6 and/or CYP2C9 genes. The 
5 sample may be derived from any tissue or bodily fluid, 
wherein the patient's DNA can be found. A clinically 
practical type of sample is a blood specimen which 
contains patient DNA and can conveniently be genotyped in 
the bioassay of the present invention. 

1° The "primers", as the term is used in the 

present invention refers to an oligonucleotide, whether 
occurring naturally as in a purified restriction digest or 
produced synthetically, which is capable of acting as a 
point of initiation of synthesis when placed under 

15 conditions wherein synthesis of a primer extension product 
which is complementary to a nucleic acid strand is 
induced, i.e. in the presence of nucleotides and an 
inducing agent such as DNA polymerase and at a suitable 
temperature and pH. The primers are preferably single 

20 stranded for maximum efficiency in amplification, but may 
alternatively be double stranded. If double stranded, the 
primer is first treated to separate its strands before 
being used to prepare amplification products. Preferably, 
the primers are oligodeoxyribonucleotides . The primers 

25 must be sufficiently long to prime the synthesis of 

extension products in the presence of the inducing agent. 
The exact lengths of the primers will depend on many 
factors, including temperature, source of primer and use 
of the method. For diagnostic methods, the primers 

30 typically contain at least 10 or more nucleotides. The 

oligonucleotide primers may be prepared using any suitable 
method, such as, for example, the phosphotriester and 
phosphodiester methods (Narang, S.A., et al., Meth. 
Enzymol. 68:90 (1979); Brown E.L., et al., Meth. Enzymol., 

35 68:109 (1979)) or automated embodiments thereof. . In one 

such automated embodiment diethylphosphoramidites are used 
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as starting materials and may be synthesized as described 
by Beaucage et al. f Tetrahedron Letters 22:1859-1962 
(1981). One method for synthesizing oligonucleotides on a 
modified solid support is described in U.S. Pat. No. 
4,458,066. It is also possible to use a primer which has 
5 been isolated from a biological source (such as a 
restriction endonuclease digest) . 

In a genotyping bioassay of the present 
invention, one embodiment comprises a gene-specific 
amplification reaction, an exon-specif ic amplification 
10 reaction and a restriction endonuclease reaction. In such 
a reaction a suitable polynucleotide polymerase is used in 
the amplification reaction, many of which have already 
been described in the art. In addition, any appropriate 
restriction endonuclease which is designed to digest the 
15 DNA and so provide information concerning genotype may be 
used. 

It may further be necessary to provide a label 
on the nucleic acid for detection. The nucleic acid, can 
be DNA or RNA and made detectable by any of the many 
20 labeling techniques readily available and known to the 
skilled artisan. Such methods include, but are not 
limited to, radio-labelling, digoxygenin-labeling , and 
biotin-labeling. A well-known method of labeling DNA is 
32 P using DNA polymerase, Klenow enzyme or polynucleotide 
kinase. In addition, there are known non-radioactive 
techniques for signal amplification including methods for 
attaching chemical moieties to pyrimidine and purine rings 
(Dale, R.N.K. et al. 1973 Proc. Natl. Acad. Sci. USA, 
70:2238-2242; Heck, R.F. 1968 S. Am. Chem. Soc. , 90:5518- 
5523), methods which allow detection by chemiluminescence 
(Barton, S.K. et al . 1992 J. Am. Chem. Soc, 114:8736- 
8740) and methods utilizing biotinylated nucleic acid 
probes (Johnson, T.K. et al. 1983 Anal . Biochem. , 133:125- 
131; Erickson, P.F. et al. 198 2 J. of Immunology Methods, 
51:241-249; Matthaei, F.S. et al 1986 Anal. Biochem., 
157:123-128) and methods which allow detection by 
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fluorescence using commercially available products. Non- 
radioactive labelling kits are also commercially 
available. Such a label can readily be incorporated into 
the nucleic acid during an amplification step. In the 
absence of an amplification step, a target nucleic acid 
can readily be chemically or enzymatically modified to 
carry a label. Additionally, it may be preferable to 
provide a labeled primer which may serve to incorporate a 
label into the nucleic acid target. Probes, as may be 
used in an embodiment of the invention may also be 
chemically or enzymatically labeled as described above. 

In a preferred embodiment of the invention said 
DNA primer hybridizes to an intron adjacent said position 
of said mutation. Preferably said DNA is a primer with 
the 3 '-end specific for the gene of interest. Preferably 
further still said DNA is single stranded. Preferably 
further still, in so far as the CYP2C9 mutation is 
concerned, said primers are as follows: 

HF18: position 8 of intron 2 onwards of genomic 
sequence in forward orientation comprises 
5' TGCAAGTGCCTGTTTCAGCA 3' 

HF2R: position 505 onwards of cDNA sequence in 
reverse orientation comprises 
5 ' AGCCTTGGTTTTTCTCAACTC 3 ' . 

It is of note that both these primers are 
designed to be specific for CYP2C9 and so do not amplify 
related genes such as CYP2C8, which notably also has an 
Arginine 144 present . 

Preferably, in so far as CYP2A6 is concerned, 
three primers J51, J61 and B are used in two parallel 
allele-specif ic PCR reactions. These primers are as 
follows: 

35 . J51 comprises 5' GGCTTCCTCATCGACGCACT 3' 

(forward strand from position 479 of cDNA 
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sequence described as hIIA3 (Yamano, et al. 1990 
Biochem 29:1322-29)). 

J61 comprises 5' GGCTTCCTCATCGACGCACA 3' 
(forward strand from position 479 of cDNA 
sequence described as hIIA3v (Yamano, et al. 

5 1990 Biochem 29:1322-29)). 

Both J51 and J61 contain a substitution at 
position 18 of A for C to give improved 
specificity as suggested by Newton et al (1989 
Nuc. Acids Res. 17:2503-2516). 

10 Primer B comprises 5' AATTCCAGGAGGCAGGGCCT 3' 

(reverse orientation from position 125 of intron 
3 of CYP2A6 (onwards) . Designed so that only 
CYP2A6 and not CYP2A7 or CYP2A12 are amplified. 

15 one method of genotyping CYP2A6 provides an 

allele-specif ic amplification reaction method is used. , In 
this instance, DNA which is adapted to specifically 
hybridize to the wild-type or the mutant type of the gene 
is incubated with test DNA under reaction conditions and 

20 the resultant products are analyzed by electrophoresis and 
then visualized by staining with ethidium bromide. 
Individuals who are homozygous for the wild-type allele 
produce a reaction product with primer J51 only. 
Similarly, individuals who are homozygous for the mutation 

25 produce a reaction product with primer J61 only. Those 
individuals who are heterozygous produce a reaction 
product with both J51 and J61. 

Alternatively, another method for genotyping 
CYP2A6 is provided in a specific amplification bioassay, 

30 which is achieved with primers F4 and R4 as follows: 

The F4 primer (forward) comprises 

5 ' CCCCTTATCCTCCCTTGCTGGCTGTGTCCCAAGCTAGGCAGGATT 
CATGGTGGGGCA 3', wherein a preferred fragment 
35 thereof further comprises 

5 ' CCTCCCTTGCTGGCTGTGTCCCAAGCTAGGC 3 ' . 
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The R4 primer (reverse) comprises 

5' GCCACCACGCCCCTTCCTTTCCGCCATCCTGCCCCCAGTCTTAGC 
TGCGCCCCTCTC 3 ' , wherein a preferred fragment 
thereof further comprises 

5 ' CGCCCCTTCCTTTCCGCCATCCTGCCCCCAG 3 ' . 

5 

This method of CYP2A6 genotyping involves a 
first amplification reaction with F4 and R4 primers, which 
generates a DNA fragment approximately 7.8 kb in size. 
' This amplification step is facilitated by polymerases 
10 which are capable of transcribing long stretches of DNA. 

To distinguish the CYP26Avl and CYP26Av2 variant alleles, 
an exon-specif ic amplification step is carried out using 
the 7.8 Kb DNA fragment as template DNA. This may be 
accomplished using the following primer pair: 

The E3F primer (forward) comprises 

5 ' CCTGATCGACTAGGCGTGGTATTCAGCAACGGGGAGCGCGCCAAG 
CAGCTCCTG 3', wherein a preferred fragment 
thereof further comprises 
20 5 ' GCGTGGTATTCAGCAACGGG 3 ' . 

The E3R primer (reverse) comprises 

5 ' CGCGCGGGTTCCTCGTCCTGGGTGTTTTCCTTCTCCTGCCCCCGC 
ACTCGGGATGCG 3', wherein a preferred fragment 
thereof further comprises 
25 5 / TCGTCCTGGGTGTTTTCCTTC 3 ' . 

Using these primers in .a second amplification 
reaction step a segment of CYP2A6 exon 3 is specifically 
amplified. The method further comprises use of the 

30 restriction endonuclease Xcml to detect the CYP2A6vl 
mutation and Ddel to detect the CYP2A6v2 mutation. 

According to a yet further aspect of the 
invention there is provided a kit for performing the afore 
described methods which kit includes at least a portion of 

35 DNA in accordance with the invention and preferably at 

least one control sample of DNA containing the mutation or 



WO 95/34679 



PCT/US95/07605 



- 25 - 

° mutations of interest and ideally also a wild-type sample 
of DNA so that suitable comparisons can be made. 

It is of note that although the method is 
described with reference to the above methods, any 
suitable method using the genetic material of the 

5 invention may be used to identify the mutations descried 
herein. 

The CYP2C9 assay has been used in a study ot 
warfarin dose requirement in 94 patiertts undergoing 
anticoagulant treatment and the results obtained are 
10 summarized in Figure 5. 58 patients (61.7%) were 

homozygous for the wild-type (Arg 1u ) allele and were found 
to require a median weekly maintenance dose of 31.5 mg of 
warfarin. 3 6 patients (38.6%) were heterozygous and 
required a median weekly maintenance dose of 24.5 mg. The 
15 doses required by the two groups were significantly 

different (Mann-Whitney U-test, P = 0.016). No subjects 
in the group were homozygous for the mutant allele but 
based on allele frequencies and the Hardy Weinberg . 
equilibrium, the predicted frequency of homozygous mutant 

20 subjects is 3.7%. 

comparison of the weekly maintenance dose of 
warfarin in the R144C heterozygotes (n = 36) and 
homozygous wild-type (n = 58) reveals that the 
heterozygotes required a significantly lower dose (range 
of 10.5 - 80. mg). Moreover, of the patients requiring the 
lowest doses to maintain an anticoagulation target (INK 
2 0-4.0) , in the range 5-15 mg per week, 9 out of 10 were 
heterozygous. At the other extreme of weekly doses >55 
mg, 5 out of 6 patients were homozygous wild-type for 
CYP2C9 The significantly lower (20%) warfarin dose 
requirement of the patients with one variant R144C allele 
is consistent with the kinetic properties of the R144C 
protein with respect to (S) -warfarin hydroxylation and 
presumed in vivo metabolic clearance (Rettie et al. 1994 

35 Pharmocogen. , 4:39-42). 

The CYP2A6 genotyping assay has been used in 
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studies on coumarin metabolism. Coumarin 7-hydroxylase 
activity is a convenient marker activity to identify the 
presence of CYP2A6 in a particular sample. There is 
considerable variation in the ability of individuals to 
7-hydroxylate this compound which is a reaction specific 
for CYP2A6 • A subject deficient in coumarin 
7-hydroxylation has been identified. This subject is 
homozygous for the mutant CYP2A6vl allele confirming the 
previous in vitro findings that substitution of LeuI60 by 
His results in loss of coumarin 7-hydroxylase activity. 
As shown in Fig. 6, CYP2A6 genotyping and phenotyping with 
coumarin has been performed on other members of the 
proband's family and impaired coumarin 7-hydroxylation has 
been observed in heterozygotes for the CYP2A6vl mutation. 

The genotyping assays described herein resulted 
from a two step amplification reaction wherein first 
amplification reaction amplifies a 7.8 Kb fragment 
containing the CYP2A6 gene (Fig. 9A) and a second 
amplification reaction amplifies an exon-specif ic fragment 
of CYP2A6. The amplification product was digested with 
restriction endonucleases producing different patterns for 
the various CYP2A6 alleles. Representative results 
obtained for several human subjects for the detection of 
the CYP2A6V1 (Xcml digestion) and CYP2A6V2 (Ddel 
digestion) are shown in Figure 9 panel B. A schematic 
depiction of this genotyping assay is shown in Figure 9, 
panel C. Of 155 human genomic DNA samples analyzed 21 
heterozygous (+/-) and 6 homozygous (-/-) subjects were 
detected for the CYP2A6vl allele, whereas 17 heterozygous 
(+/-) and no homozygous were identified for the CYP2A6v2 
30 allele variant. Additionally, 7 homozygous for both 

CPYP2A6vl and CYP2A6v2 alleles were found. 

Allelic frequencies were calculated for either 
allele in several ethic groups and analyzed as shown in 
Table l. CYP2A6vl frequency is almost identical between 
Caucasian and Japanese, and it is only twice the frequency 
in Taiwanese samples. Significantly, this allele is 
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completely absent in the African-American population 
within the samples studied. The Japanese population has a 
remarkable higher frequency for the CYP2A6v2 allele (28%) 
as compared to the Caucasian (2%), Taiwanese (6%) or 
African-American (2.5%) (ethnic groups) . 

Table l : Allelic frequency for the CYP2A6 gene in 
different ethnic groups. 

Allelic Frequencies (%) 



Ethnic Group 


CYP2A6 


CYP2A6V1 


CYP2A6V2 


N 


Caucasian 


75 


23 


2 


52 


Japanese 


52 


20 


28 


40 


Taiwanese 


83 


11 


6 


178 


African-American 


97,5 


0 


2-5 


40^ 



The following examples illustrate various 
aspects of the present invention and in no way are 
intended to limit the scope thereof. All books, articles, 
and patents referenced herein are incorporated herein,- in 
toto, by reference. Other similar embodiments will be 
clear to the skilled artisan and are encompassed within 
the spirit and purview of the present invention. 

EXAMPLE 1 

Method for determining th e genotype CYP2C9 

Genotyping for the CYP2C9 polymorphism is 
carried out by amplification by PCR followed by digestion 
with the restriction endonuclease Avail. Amplifications 
are performed in 0.5 ml microcentrifuge tubes in a volume 
of 100 /il containing 10 mM Tris-HCl, pH 8.8, 1.5 mM MgCl2. 
50 mM KC1, 0.1% Triton X-100, 5% dimethylsulphoxide, 200 
M M each of dTTP, dATP, dCTP and dGTP, 250 fiVL of the 
primers HF18 and HF2R, 2.5 units Taq polymerase and 1 /x9 
human leukocyte genomic DNA. PCR conditions consist of 3 5 
cycles with a denaturation at 93 *C for 1 min. annealing at 
55-C for 1.5 min and polymerization at 72'C for I min. 20 
jil of the amplified DNA is incubated with 10 units Avail 
for 3h at 37 °C and then analyzed by electrophoresis on 
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1.8% agarose minigels in TBE (90 mM Tris-borate, 2 mM 
EDTA) buffer. The digestion products are visualized by 
ethidium bromide staining. DNA from individuals positive 
for the wild-type Arg,44 is digested to give fragments of 
270 bp and 50 bp whereas in individuals with the mutant 
C Y S H4 Present, a band of 3 20 bp is seen due to loss of an 
Avail site (Figure 3). 

EXAMPLE 2 

Genotyping for the CYP2C9 polymorphism was 
carried out by amplification by PCR followed by digestion 
with the restriction endonuclease Avail. 

One hundred patients were recruited from two 
anticoagulation clinics in the Newcastle area over four 
study days. Body weight and height were measured, the 
basal metabolic index ("BMI") calculated for each patient 
and details of age, sex, drug history, current and 
previous International . Normalized Ratio ("INR") 
determinations, indications for anticoagulation and other 
significant health problems were all recorded. DNA was 
isolated by a standard manual, chloroform-phenol extraction 
procedure and l^g was subjected to PCR analysis. As shown 
in Figure 10 the C->T substitution, which converts Arg-+144 
to Cys, resides in exon 3 of the CYP2C9 gene and results 
in the loss of an Avail restriction site 
( . . . GAGGACCGTGTTCAA . . . ) in the R144C allele 
( . . . GAGGACTGTGTTCAA. . . ) . This provided the basis of the 
amplification strategy. A CYP2C9 specific intron forward 
primer (HF18, TGCAAGTGCCTGTTTCAGCA, Figure 10) and a 
CYP2C9 exon 3 3 '-end reverse primer (HF2R, 
AGCCTTGGTTTTTCTCAACTC, Figure 10) were used at a 
concentration of 250/iM each. Amplifications were 
performed in a volume of 100 /xl containing 20 mM Tris HC1 
(pH 8.3), 1.5 mM MgCl 2 , 25 mM KC1, 0.05% (w/v) Tween 20, 
10 fxg gelatin/ml, 2% (w/v) DMSO, 200. /zM each of dATP, 
dCTP, dGTP and dTTP and 2.5 units of Taq DNA polymerase 
(Perkin-Elmer) . Reactions were carried out for 35 cycles 
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at an annealing temperature of 55 °C for 90 sec, a 
polymerase temperature of 72 'C for 1 min, and a heat 
denaturing temperature of 9 3 *C for 1 min, using a Perkin- 
Elmer Cetus DNA thermal cycler. The PCR products digested 
with Avail and sized using NuSieve agarose gels (3% 

5 NuSieve, 0.75% agarose). Presence of the CYP2C9 wild-type 
and R14 4C alleles were detected as fragments of 50 + 27 0 
bp and 320 bp respectively (see Figures 3). The PCR 
product synthesized from human genomic DNA with the 
primers HF18/HF2R was directly sequenced on an ABI 373A 

10 automatic sequencer. Briefly, the PCR product was first 
purified by using the Wizard DNA clean-up system (Promega 
Co., Madison, WI) . The purified template was then 
subjected to dideoxy terminator cycle-sequencing with the 
primers HF18 and HF2R. The primer-extended products were 

15 purified and sequenced following the manufacturer's 
procedure. Sequence analysis was done by using the 
MacVector software program (Eastman-Kodak Co., Rochester, 
NY) . 

DNA was obtained from 94 patients. Of these, 58 

20 (62%) were homozygous for the wild-type. CYP2C9 gene and 36 
(38%) were heterozygous for the R144C allele. No R144C 
homozygotes were found. The frequency of the wild-type 
(Arg-144) and R144C (Cys-144) alleles in the study 
population is thus 0.8 08 and 0.19 2 respectively. An 

25 expectation of 3.7% R144C . homozygotes can be anticipated 
from the Hardy-Weinberg equilibrium, but the 95% 
confidence interval in this estimation of 0.8-8.4% and 
thus the finding of zero homozygotes in 94 patients is not 
significantly different from expectation. The specificity 

30 of the PCR reaction with respect to the CYP2C9 gene was 
confirmed by sequencing. The alignment of the sequence 
obtained from the PCR product with that corresponding to 
the CYP2C9 gene showed a 100% degree of homology. 
Interestingly, a heterozygous pattern was obtained for the 

35 R144C allelic variant, confirming the high frequency of 
this allele within the normal population. No sequence 
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deriving from CYP2C9, CYP2C18 or CYP2C19 was found 
confirming the specificity of the assay for CYP2C9 • 
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EXAMPLE 3 

Method for determining the genotype CYP2A6 

Genotyping for the CYP2A6 polymorphism is 
carried out by allele-specif ic PCR using two parallel PCR 
reactions, one specific for the wild-type allele, one for 
the mutant allele. Amplifications are performed in 0.5 ml 
microcentrifuge tubes in a volume of 45 fil containing 10 
mM Tris-HCl, pH 8.8, 1.5 eM MgCl 2 , 50 mM KCl, 0.1 % Triton 
X-100, 5 % dimethylsulf oxide, 200 /xM each of dTTP, dATP, 
dCTP and dGTP, 250 ^M of the primers B and either J51 or 
J61, 1.25 units Taq polymerase and 1 fig human leukocyte 
genomic DNA* PCR conditions consist of 40 cycles with a 
denaturation at 93°C for 1 min. , annealing at 57°C for 2 
min and polymerization at 70 °C for 2 min. The products 
are analyzed by electrophoresis on 1% agarose minigels in 
TBE buffer and DNA is visualized by staining with ethidium 
bromide. As shown in Figure 4, there are three possible 
results: the individual may be homozygous for the 
wild-type allele and give a DNA product only for the PCR 
reaction with primer J51, the individual may be 
heterozygous with one wild-type and one mutant allele and 
give DNA products with both primers J51 and J61 or the 
individual may be homozygous for the mutation and give a 
DNA product only with the J61 primer. 

EXAMPLE 4 

Alternative Method for Determining the Genotype CYP2A6 

For use of F4 and R4 primers, each reaction 
mixture contained 600 ng human genomic DNA, 0.2 of each 
primer, 2 00 jiM dNTP's, 0.8 mM magnesium acetate and 2 
units of rTth I DNA polymerase. Hot start was as 
indicated by the manufacturer (Perkin Elmer) and the 
amplification reaction of 31 cycles of 93°C, 1 min; 66°C, 
6 min 30 sec. Amplification products were analyzed in 



WO 95/34679 



PCT/US95/07605 



- 31 - 



0.7% agarose gels and the DNA visualized by staining with 
ethidium bromide. For the exon 3 specific amplification, 
the reaction which uses, the primers E3F and E3R consist 
of 5/xl of the 7.8 Kb PCR reaction, 0.5 nK of each primer, 
200 /xM dNTP's, 1.5 mM MgCl 2 and 2.5 units of Taq DNA 

5 polymerase. The amplification reaction consisted of 94 'C 
for 3 minutes followed by 31 cycles of 94 °C, 1 minute; 
60 °C, 1 minute and 72 °C, 1 minute. 

Amplification products were then digested 
without purification with restriction endonucleases which 

10 detect the CYP2A6 wild type (no digestion) , CYP2A6vl 

(Xcral) and CYP2A6v2 (Ddel) . DNA was visualized by use of 
ethidium bromide after electrophoresis in 1% agarose, 3% 

NuSieve agarose. 

It is of note that CYP2C9 genotyping can be 
15 performed using an allele-specif ic assay similar to that 
used above for CYP2A6 . 



20 



25 



30 



35 



WO 95/34679 



PCT/US95/07605 



10 



15 



20 



25 



- 32 - 

CLAIMS 

1. A CYP2A6V2 DNA having a coding sequence 
shown in Figure 11. 

2. The DNA of claim 1 having a genomic 
sequence as shown in Figure 12 . 

3. A CYP2A13 DNA having a coding sequence 
shown in Figure 13. 

4. The DNA of claim 3 having a genomic 
sequence shown in Figure 14 . 

5. A nucleic acid primer sequence comprising 
at least ten (10) contiguous nucleotide bases selected 
from the sequence showing in Figure 12 . 

6. A nucleic acid primer sequence comprising 
at least ten (10) contiguous nucleotide bases selected 
from the sequence shown in Figure 14. 



7. A nucleic acid primer sequence selected 
from the group consisting of: 

A. 5' GGCTTCCTCATCGACGCACT 3'; 

B. 5' GGCTTCCTCATCGACGCACA 3'; 
C 5' AATTCCAGGAGGCAGGGCCT 3'; 

D. 5' TGCAAGTGCCTGTTTCAGCA 3'; 

E. 5' AGCCTTGGTTTTTCTCAACTC 3'; 

F . 5 ' CCCCTTATCCTCCCTTGCTGGCTGTGTCCCAAGCTAGGCA 
30 GGATTCATGGTGGGGCA 3 ' ; 

G . 5 ' GCCACCACGCCCCTTCCTTTCCGCCATCCTGCCCCCAGTC 
TTAGCTGCGCCCCTCTC 3 ' ; 

H . 5 ' CCTGATCGACTAGGCGTGGTATTCAGCAACGGGGAGCGCG 
CCAAGCAGCTCCTG 3 ' ; 

35 1.5' CGCGCGGGTTCCTCGTCCTGGGTGTTTTCCTTCTCCTGCC 

CCCGCACTCGGGATGCG 3 ' ; 
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or any nucleic acid sequence of at least 10 contiguous 
nucleotides selected from any one of A-I. 

8. A method of determining the presence or 
absence of an allelic variant in CYP2A6 or CYP2C9 DNA 

5 comprising: 

(a) amplifying an exon containing a variant 
sequence with in said DNA, producing an 
extension product; 

(b) treating extension products with at least 

10 

one restriction endonuclease under 
conditions sufficient to produce digestion 
fragments ; 

(c) analyzing the digestion fragments, for a 
variant specific digestion fragment or lack 

15 thereof. 

9. The method of claim 8 wherein a CYP2C9 
variant DNA is being detected. z 

20 io. The method of claim 9 wherein the 

amplifying step is a polymerase chain reaction using 
primers comprising HF18 and HF2R. 

11. The method of claim 8 wherein step (a) is 
•25 preceded by a gene-specific amplification reaction. 

12. The method of claim 11 wherein the gene- 
specific amplification is a polymerase chain reaction. 

30 13. The method of claim 12 wherein a CYP2A6 

variant is being detected. 

14, The method of claim 13 wherein a gene- 
specific amplification reaction uses primers comprising F4 
35 and R4 and the exon amplification reaction uses primers 

comprising E3F and E3R. 



WO 95/34679 



PCT7US9S/07605 



10 



15 



20 



25 



30 



35 



- 34 - 

15. The method according to claim 10 wherein 
the extension products are treated with the restriction 
endonuclease Avail . 

16. The method according to claim 14 wherein 
the extension products are treated with at least one 
restriction endonuclease comprising Ddel and Xcml. 

17. A method of determining the presence or 
absence of an allelic variant in CYP2A6 or CYP2C9 DNA 
comprising: 

(a) contacting said DNA with a first primer 
encompassing a nucleotide variation 
specific to variant DNA and a second primer 
which is complementary to a region of said 
DNA such that upon hybridization and 
amplification, an extension product will be 
formed; 

(b) analyzing the extension products for 
allelic-variant specific extension 
products. 

18. The method of claim 17 wherein a CYP2A6 
variant DNA is being detected. 

19. The method of claim 18 wherein the 
amplifying step is a polymerase chain reaction wherein the 
first primer comprises J51 and J61 and the second primer 
comprises primer B. 

20. A kit for determining the presence or 
absence of an allelic variant of CYP2A6 or CYP2C9 DNA 
comprising: at least one nucleic acid, primer sequence 
capable of hybridizing to said DNA; the kit further 
containing instructions relating to the determination of 
the presence or absence of an allelic variant of CYP2A6 or 
CYP2C9 DNA. 
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21. The kit according to claim 2 0 further 
comprising amplification components and at least one 
restriction endonuclease . 

22. The kit of claim 20 wherein the CYP2A6 
allelic variant is being detected. 

23. The kit of claim 22 wherein the nucleic 
acid primers comprise F4 , R4 , E3F and E3R. 

24. The kit according to claim 20 wherein the 
CYP2C9 allelic variant is being detected. 

25. The kit according to claim 25 wherein the 
nucleic acid primers comprise HF18 and HF2R. 

26. A process for providing a human with a 
therapeutic CYP2A6v2 or CYP2A13 DNA segment said human 
cells expressing in vivo in said human or therapeutically 
effective amount of said protein. 

27. A pharmaceutical composition comprising an 
antisense nucleic acid derived from CYP2A6V2 DNA. 

28. A pharmaceutical composition comprising and 
antisense nucleic acid derived from CYP2A13. 
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CYP2A6v2 cDNA. 

5 ' TCTAOZACC ATGCTGGGCIXZAGGG^ 

GAGKjTCATGGTCTKiATGITO 

GCIX3CCTOa3GGACCCACCCCATItXX^ 

ACACAGAGCAGATGTACAACTCCXrrcATGAAGAT^ 

OOGTCJITCACXATre^CTTGGGGGaCCG^ 

ATGGCGTCAGGGAGGCItnGGTtX^ 

GAGCAAOXACCITCXiACTCCKjIC^ 

AGGGGGAQGGOGOCAAQCAGCIXXTGOGCITIGaZATCXX^ 

TOGGGGTGGGCAAGCGAGGCATCGAGGAGCXjCATCX^ 

CTCATCGACXjCC^TCCGGAGCACG 

CTX3AGCXX3CAC^GTCKX^TGTCATC^ 

TGACTATAAGGACAAAGAGTTCCTGTCACnX^TrGCGCATGATGCTAGGAAT 

CTTCCAGITCACGTCAACrTaZACGGGGC^ U 1 1U1U lOGG 

TGATGAAACACCTGCCAGGACCACAGCAACACKj^^ 

GCTGGAGGACTTCATAGCCAAGAACIGTGGAGC^ 

TCXXAATTCXXX^CGGGACTTCATTGACra 

AGGAGAAGAACGCCAACACGGAGTTCTACTTGAAGAACXJrG^ 

ACGriGAACCTCITCATrGCAa3CAGOGAG^ 

TGGCTICITGCTCCTQ\TGAAGC^ 

GAGATTGAG\GAGTGATC03CAAGAACCGGCAGaXAAG 

GCCAAGATGCCCTACATGGAGGCAGTGATCCACGAGATC^ 

GACGTGATCCCCATGAGTTTGGCCCGCAGAG 

GGGATnXjrrCCTXXXJTAAGGGCATAGAAGTGTrCXCT 

CTGAGAGACCnXZAGGTTCTIOXXAACXXXXIXjGGAC^ 

CIXjGGTGAGAAGGGKjCAGTTTAAGAAGCGTGATGCT 

TCAGAAAGCGGAACTXnTTGGGAGAAGGCCTGGCCA 

CTIOTCACXACCGTCATGCAGAACrnXGCCT^ 

AGGACATTGACGTGTCXXXXZAAACACXJrGGGC 

(m3GGCGGGGCrAC,GGAAAGGGCAG^ 
G<^nCTAAGACn3GGC»GCAGGATGGCGGAAA 
AGGGAAGAGAAGAAACAGAACiCGGCTCAGTTCACCH^^ 
OAGCTGGGATtoAGAGGAAGGAAACCCTTACATTATGCTA TGAAGAGTAGr 
AATAATAGCAGCTCTTATTTCCTGA 3 ' 
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, ^srrcccc. g^tatggc tctcoxcttc c™gc a*™ 

61 GAGGTTCTAT GGCAGCCATC CTGGCCTCAC TCTGAGGTTC CTGGGCTGCT 
121 AAGAGACAGC TCTGGGCAAA GCTAAATCAA • GTCAGCCCCT aaAC*CCACA 
Si £S?^C?5 GGAGAACCCC OCTGGGCTrC CTJCACACTC 

241 CCCACAGCCC TGGGTCTTCC TAGCCCCGAG ACTTTCAACT C^ATM*. CACAGCCACA 
301 TTCCTGAGAC CCTTAACCCT GCATCCTCCA CAACAGAAGA CCCCTAAAT^ 

Si SSSSS *s=sSS SSI SSSS £££££ 

"1 CCGCaCMCT TTGGGGMCA SSSSc CTCCT*«TC 

Si = s£ = ~- 

sisssssssssss^ESsssss 

- SSSS K 3=3 ™ -SSS SSSE 

841 GATGGTCTTC ATGTCTGTTT GGCAGCAGAG &CAGAGCAGA TGTACAACTC 

901 CACCCCATTC CCCTTCATTG GAAACTACCT GCAGCTGAAC JCAGAG^A TGCCTAGTTG 

1681 TTTACCAGCC CTGGGTCTCT. ° ,^^1~2 -.-..mrr TCC'CTGTCT TAGGATGCCA 
1741 CTCTGGGTTT CTCATCTCTC CGGATCCCTT TCTCAATTCT ^CTGTCT CTCTTCCTTC • 
llil GGGTTATTCC TACTTCCACA TCTTCAGGCT CCATCTCCTG ^CTTCCTT 

SX ££55 SEES =S£ S££= r 

- = === ==£ ™ ™ £552 

ss ssss ssss sees s -™ sss 

2461 GACTCTCTCC CCAACCCCCT ^^ACA ^^GAGG £S££5gG GACCGCTTTG 
2521 TCTTCCTGAG CCGCACAGTC fCCAATGTCA ^AK^CAT SSgAATC TTCCAGTTCA 
2581 ACTATAAGGA CAAAGAGTTC CWIQCTW ^agGCC CTTACCAAAA 

2641 CGTCAACCTC CACGGGCCAC °™HKBT«S SgIccSS SSISc CCCCCGGACA 
2701 CCGGCAAATT GTTCCCCTAC CGGGGGAAG^ eOGCCCgAA IgAGTCGAAC CAGACCCGGG 
2761 GTGTCCCCTC AAAATCAGTC CCCGATTTGG GCA^TOGC £^ tgcTCCCCAA 
2B21 TTGGTTGTCC AATCCCCTGC TCTCCAGGGA GCACAAU^ GGGCACG TGT 

288 1 AACAGAGCCT GCTGGCAGCA SSSSI SSSSS SScSS CXTCTTGAAT 
2941 TCCCATCCCC ^^ACCCC ^TTTCTAA TAAACTTTAG AGATTAGTTC 

3001 attttaacac ccggaaaccc Sagaccag ATGCCTTTAA ctcagttcct 

3061 ctatccggcc cctctgaaat acctaaccac cccgtgacag ctgtccttcc 

3121 tccttgctat gaaacaaatc ccattcccat ^°^tgc cccgtg^ grtgaaACAC 
3iBi crrrcccATCC tctctctgca ^ccccagctc Lctggagga cttcatagcc 

3241 CTGCCACGAC CGCAGCAACA OGCCTTTCJG CCCCACGGGA CTTCATTGAC 

llll ££SS SSSSS SEES «™* ™- 
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3421 CAGGCAGAGG GAAATCAGTC ~GGG GCAGSCAGAT GACACAGGCC CAT£AAArr 
3481 AACCCTCATC ATAATAATCC ACACCAGCCT 
3541 CAGCACTTTG ^GGCCG*G GCAGGTOSAT <££££L AATTAGTTGG GCATGGTGGC 
3601 GGCCAACATC ^^CCCC GTCTCTACTA ATTGCACTC C AGTCTGGGTC 

3661 GCGAAGGGGG GCAGAGGTTG ^TGAGCCA ^ATCAC^C ^^^ fl> AGTCAGCCTG 
3721 ACAGAATGAG ^CCTGTGTC SS^GC TGAGGCTGGA GGATTGCTTG 

3781 ■CATCGTCATG CGCATGTGCA «^AGC^ ££M^A^ CCAAGTCAGT ATAAGAAAAA 
3841 AGCTCAGGAG ££££££ ACGGACAGAT GCXCAGCAAG 

3901 AAAAAAACAA AAAAAAAOT GGCAACGCCA GGAGTCAGGG CAACGGCTGG 

3961 GTAACGAAGG TGAGAAGGAA CCCCTCTTCT CCACCCTGCG GTCTTGCCCC 

4021 TTCCTAGAGC C^TCTGCTA £££^2 TCTGTGTAGA. TCTTGGGGTC 

4081 AAAGAGAGGT CGAGGGTGCT GGGATTGCGC xCCATGGGGT GAACCCCTAG 

4141 CCCTCTTGAC CCCCATTGGT CTGAA^TAA GAGTGGAA3A CCTTCAGGAG 
4201 ATGGTGCCCT ^^AGC ^tgaGCAC GTTGAACCTC 

4261 GAGAAGAACC CCAACACGCA ^TACTTC ^^^^ ATGGCTTCTT ACTGCTCATG 
4321 TTCATTGCAG GCACCGAGAC GCraGCACC SSgACGGA AGTGGAGGGC CCCAGACCCT 
4381 AAGCACCCAG AGGTGGAGGG SSSS SStCCCGG GACCCTGAGA 

4441 CAAAATTCCC CTTCGACTGG TGCAATGTCC CCACCTGTCC ^ TGAGTCTCAT 
4501 CGTGACTTGC TGTCCACAGA C^GGCAACA ££GCTGGT GTCACTTCTG TCCCAAGCCC 
4561 TAGATATTAA AATATTGAAA J^^*: ^££ TAA fflTCCTCCCT GTGCCTCCCC 
4621 ACTGAGTGCC CACTGCCCGT ^CGGGT ^CCCTAA TGGGTGATGT 
4681 TGTGATTCTG GCACAACCTG GTTAACAGGA ^TACTCCA TC CACCCCAT 
4741 CTGTTCTGTT ATGAATGCTC ^CTTCCGTC J^™^* AGATACCTAA ACACATTCCC 
4801 TTTGCCTATC ^SaSS IS^SgS ^SgCAA GAACCGGCAG 

4861 CCTCCTCCCC CAGCCAAGGT CCATGAGOAU m CAGTGATCCA CGAGATCCAA 

4921 CCCAAGTTTG AGGACCGGGC CAAGATGCCC ™£ATGGAGG 

4981 AGATTTGGAG ACGTGATCCC CATGAGTTTG S^SaCTAC GGGGACTCCA 

5041 CGGGATTTCT TCCTCCCTAA SS£££ SScTAGAC CCTGTCCCAC 

5101 GCCCCTCTCT GTOTCCCCAG ^^ACCC a SSSaT CCGTTCCACC TTTCCACTTA 
5161 TCCCTCAATC AGTCAAAAAA GACTTCCCCA ACCACCACAT CC^ jg^MiT 
5221 GACACTCCTG AGTCCTGCAT CTCTCCAGAC ^TTTGTGTC TGTCATAGGG 
. 5281 CCCAAACTTC CTATCTTAAG AAACAGAAGC ATCGACCCCX TGTCTCCCAA 

5341 ACAGAAATCT CAGGTCCCCC AAACTCCTGC S™^^, tcCTCCCTCA GAGGTCCCCA ■ 
5401 ACTTCCTGTT TCAGAGATGT aAKCTTCT* SScStS ScCCCCTCC AGCCCCTGTG 
5461 ATTCCCATGC CTGCCACTTC CCCTCACCGG «C*CCCX*S CTCCCAGGCC 
5521 TACTCTCAAC AATCCCCCAA CCCGCCTCAT TCAGGTTCTT CTCCAACCCC 

55B 1 ATAGAAGTGT SSagSIc AG^SSU GCGTGA^CT 

5641 CGGGACTTCA ATCCCCAGCA ^^^O^ ■■■■i-lGOTG CCAGGCTTAC TACTCACACC 
5701 TTTGTGCCCT TCTCCATCAG ^SSS aS^GTAT TTCCCCAGCT 

5761 AGCAGGGGCC TCCCTTACCC ^^CC^ A ScACCAGG TGATACTCCC TTAACTACCA 
5821 TGGCAAGTTC CTGTTA^AA ISSaSS TCATACCCCT TTCAGAGGCG 

5B81 AGCACCCAGT ACCTCTCCCC ^.ATrrc CCTAGGGTCA CACAGGAGAT 

5941 GGGGAAAACC SSSSSS SSgCAGGT CATATTTGGG AGTTCTTATC 

6001 TCTTCAGCAT CCCTAAAAAG SgGACACCT GGCATCGATC AACCCCATCT 

6061 TGGGGGAAOG GGGATCTTAA ACCTCCCATT ^^Z^AA GGAGGGTCAA GAGGCTCCCT 
6121 TTTGGTCATC mTGGGTCA ^AAGGAAA CTGAGGTCAA GGA^ ^ 
6181 CTTAAAGTCT CTCAGGGCCA TATAJTCCAC CTCCACCCCT CCCGCCTCTC 

6241 GTCGGTACTG GGGCGAGGCT AGAATGGAGC TCTTTCTCTT 

6301 CTCCTCAGGA AAGCGGAACT ^S^C C CAGTCACCTA AGGACATTGA 

6361 CTTCACCACC GTCATGCAGA ^TTCCGCCT aACTACACCA TGAGCTTCCT 

6421 CGTGTCCCCC AAACACGTGG GCTTTGCCAC GCGGGGCCAG GGAAAGGGCA 

6481 gccccgctga gcgagggctg tcc^^ SSaaIact gggggcagga tggcxsgaaag 
6541 gggccaacac cgggcttggg agaggcgcgc agctaa cggctcrgtt cacCTTGATA 
6601 gaaggggcgt ggiggctaga gggaa^agaa caaa^ mttatgctat gaagagtagt 

6661 AGGTGCTTCC S£aS ^cSS TCACCTTTGT TCAAAAACCA 

S 6 7 7S1 SS cSS^C CCCTTCGAAG GGGCGTTCAT GCCCA^A 
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CYP2A13 cDNA 

5 ' £EGGG£AOC ATCCTCGCCT 

GACTCTXl^TGGraTCATG?I^ 

GCraXTOCGGGACOCACXXX^^ 

CACAGAGCAGATGTACAACTCGCTCATUAAGATCAGTC 

GTCHTCACC^TICACrn3GGG^ 

GODGrTCAAGGAGGCICroGTGGACCAGGC^ 

GCAGGCCACCITGGACTGGCICTTCAAAGGCrATGGCGT^ 
GGGAGCT30CKDCAAGCAGCIXXXJGCG<^^ 

G0GTTGGGCAAG0G0GGCA3O3AGGAA0GCATC^ 

ATOGAOSCCCItXriGGGCACGCACGGC^^ 

AGO^jCACAGflUnXAATCnCA^ 

CTATGAGGACAAAGAGTiranGTCACnXTITGC^ 

CAGTrCACGGGAACCrCCACGGGGCAGCnn'ATG^ 

GAAACACXHTGCCAGGACCACAGCAACAGGCJCITrAAGGA 

GGAGGACTIXZATCGCXIAAGAAGGTGGAGCACAACXAGO 

AATTCOCCAGGGGACITCATCXjACrrCXnTICTC^ 

GA AGAACOCX!AAC ACAGAGTTCTACTTGAAGAAOCTC 

GAA(XTCTIXTITrcaXKjCACK^ 

OCIXKrraCIXL\TGAAGCAOCC^ 

GACAGAGTGATCXXX^AAGAACCGGCAGGCCAAGTTTGAG^ 

ATGCCCTAC ACAGA GGCAGTCATCCACGAGATCX^ 

CTCCaZATGGGTITTK3COCA^ 

TCTTCCiaXTAAGGGCACITGAAGltnT^^ 

GACCXXAGGncnCipCA^^ 

GAGAAGGGGCAGTTTAAGAAGAGTGATGCTTTTGTGCCCI^^ 

AAGCGGTACIXJTTITGGAGAAGGCCTGGCCAGAATGGAG 

TCACCACCATCATGCAGAACriTC^ 

ATOGACGTGTOCXIXXZAAACACGTGGGCrTTGOCACGATCCXZA 

CATGAGCXTOCTGCXm3CTG AGawy^^ 

CK3COXAGGGAAAGGGOCX}GGGCAr/im^ 

AAGAATGGGGGC2AGTG GGGGAAGGAAGGGGAGAGGTGGTTAOAGGGAACA 
GAAGAAACAGAAGGG GCTC!AGTTCAGCTTGATnATGTr(TrT^ 

ATGAGAGGAAGGGA AACCTTACAGTATGCTACAAAGAGTAGTAATAATA 
GCAGCTCTTATCTOTTGA 3' 
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3421 CC^CACTTC ACXATCTTCA CCAGCCCCAC TTJATACCTG AGCACCTCAA CAAAA^CC 
3.i81 CAATCCAGAC CCAGTAAGTA TCTGGACAGC ^TCTCCAAC ^ AAACACCTGG 
3541 AATACCTAGA CAGGTGCCAC TCAC^J A S£SS£ SSaGATCT GTGCTCCAAT 
360 1 ACACCTCTCT TCCAACTCAA ™"TGA ATATCTGAAC ACCTAAAT TA 
3661 CCAGCCTCAT TTGCATACCT ^ACCTCGA gatATAAGTT TCCATCCAAC 

3721 CTAGACCCTC CCCCTGGCAC CTAA^CAK ^AACTTA CCTTGCTATC 
3781 CCCACTGAAA TACCTAAACA CCTTOACAfiA £*L^^ CTGTCCTTCC CTTCCCATCC 
3841 AAACAAATCC CCATTCCCAT ^TCCTG. ^SSaCAC CTGCCAGGAC 

3961 CACAGCAACA «C«TTAAG S^^^S CTTCATCGAC TCCTTTCTCA 

4021 AGCACAACCA GCGCACGCTG ^CCAATT GGTGCAAACC CAGGGAGAGG 

4081 TCCGCATGCA GGAGGTACAT CCCAGCAGCC AG £*^* SScAAATT AGCCCTCGTC 
4141 GAAATCAGGA TGGGAGTGGG CTGGGCAGAC S^SJicC CAGCACTTTG 

4201 ATAATAATCC TTACAATTGG CCAGGCG^ AGACCACCCT GGCCAACATG 

4261 GGAGSCCGAG GCAGGTGGAT CACCTGAGGT CAGGAGTTCC ATGCGCCTGT 
4321 GTGAAACCCC GTCTCTACTA JJAATACAAA S^?SgG AGGCAGAGGT 

4381 AATCCCAGCT *CTCAGGAGG CTGAGACAGA AGAATTTGTT AGACCCTGTA 
4441 TGCAGTOAGC CGGGATCATG ^ACTGCACT ^.^catCA CCCACTGCTG 

4501 AAAAAAAAAA AAAAAAAAAA AAAAAATTCC a^GGATTA GATTGGAAAG 

4561 TCCCATCTAC TGAGCCCTCA CCCACAAGGA CGGCTTATGG TAGTCCGCCA 
4621 AACTTCTCAA GAACTACCGG ^^AGCA^ GCCCCAGTTG TACAAATGAC 

4681 TGGAACACTT TTAACAGTTC TTGAGGGj^G J^ACTCATC AGGAAGACCA 
4741 GAAACTGAGG CCCAGAGAGT TTAAGTGTCT ^Scm TTAGCCACCA 

4801 TGGTCCCCCT AGCTCAAACC CTGGTCTCTC TGAGCCTATA GCTGG^ GTGACCTGGC ; 
4861 TGCTCTCTAA CCGTTCATGT CCTCGTTAGC AGACACACCT CTGTGGACAG 
Sal TTTACATTGC AGGGTCCCCG CCTACCTCTG f^SSS AAGTTTCAAG 

4981 GGAAGCCAAA GCTCAGGGAG AAAGGATCAA J^****™ GTGCTTGTAT CTCAGAAAAA 
5041 ATTTTTAGGG AAGAAATAGG CTCTGTACTA CTTTCTCTTC GTCTCCCCTC 

5101 CTC *ZZZZ SJSSS £££££ £££££ SSSSS aacatggcaa 

5161 ATCCTTCTCT TTCCAAATAT TCCTATCATT ATGCCTGCGG TCCCAGCTAC 

5221 AACCCCGTCT GTACAAAAAA ATGGCTAGGC ^aScC„ TCAATGAGCC 

5281 TAAGGAGGTT GAGGTGGGAG ^ATCTTGA ^CA^GTG SSgTCTCT CCCAAAAAAA ; 
5341 CATATCACAG CCCTGCCCTC ^TGGGT ^SS^ SSSSgTG CATGTGCAGT 
5401 AAAAGAATTA ATTTTTTAAC AGTTAACAAG ^=~L™ CCCAGGAGTT GGAGTCCAGC 
5 5461 TCCAGCTACT JTOGAGGCTG AGACCGGAGG ^CCTTGAA GACAGCTAAG ; 

5521 CTGTGCAACT TAGCAAGACC AAGTCTGCAT aJ^gaAGG TGAGAAGGAA GAGCATTTTC 
5581 TTGACAATTA AAGGATAGAT ^^CAGTGAG SS^CA AATCTAGGGT 

5641 GGCAAAGCCA GCAGCCAGGG ^AGGGCTGG ^TGGAGC ^SgAAA agccCTAGAA 
5701 CCCTCTTTCC ACCTTTGGTC TGGACCAAAG AGAGGTAGCT CC^^ CAGGAGGAGA 
5761 GGGCCCCAAG AGCATGGAGA 3TOAGCTTOG TCTAAAC^C CCTCTC^TG 
5821 AGAACCCCAA CACAGAGTTC TACTTGAAGA ACCTGGTGAT CTCATGAAGC 
5881 TTGCGGGCAC TGAGACCGTG AGCACCACCC TGCGCTACGG 

5941 ACCCAGAGCT GGAGGGTAAG ACTGGAAAGG TCCCAGGACC CTGAGACGTG 

6001 ACTCCCCTGA GCCTGGTGCA GTGTACCCAC ^TA^^ ScAGCTCAG TCTCATTAGC 
6061 CCTTGCTGTC CAGA3ACAGG ACAATATTCA CTTCC^CC AAGCCCACTG 

6121 TATTAAAATA TTCAAAATGT ^TCCACT^AT ^^^^ CTCCCTTGTC CTACCCTGTG 
6181 AGTGTCCSCT GCCTGCTCCT CIGGATCATC CfCTAACTTC .^^^ GGGTGATGTC 
6241 ATTCTGACAC AACCTGG^T AACAGGGATC CTGCTG^AAA CA^^ XCTCAACCGC 
6301 TTGTrCTTGT "ATGAATGG GCTTACCCTT CGTCTCAGA^ 

6361 CGTcrrrrAG ^aggggggg cg^ta^tgc ^^^^ TCGGCAAGAA 

6421 ATATTCCCCT ^"CCGCCAG ^AAGGTCCA * ACAGAGGCAG TGATCCACGA 

6481 CCGGCAGCCC AAGTTTGAGG ^CCGGGCCAA CACAGGGTCA ACAAGGACAC 

6541 GATCCAAAGA "^GACA TGCTJCCCAT CTCCACCACC ACCACTCAGA 

SS S^SS SSSS |= 

r, 2 .i ssss ss SSS^ ISSSS »c™c ™« T 
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3421 CCTCCACTTC AGCATCTTCA CCAGCCCCAC TTTATACCTG AGCACCTGAA CAAAAGCCCC 
3481 CAATCCAGAC CCAGTAAGTA TCTGGACAGC TGTCTCCAAC CAAGTCCACT TGAATGCCTA 
3541 AATACCTAGA CAGGTGCCAC TCACCTCATA CCAGCCCCAC CTGAAGAGCT AAACACCTGG 
3601 ACAGGTGTCT TCCAACTCAA CTTCACTTGA ATATCTGAAC ACCTAGATGT GTGCTCCAAT 
3661 CCAGCCTCAT TTGCATACCT GAAACCTGGA TATATGCCTC AGTTCTTCTC ACCTLAAATTA 
3721 CTAGACCGTG CCCCTGGCAC CTAATCCACG TGAAAACTTA GATATAAGTT TCCATCCAAC 
3781 CCCACTCAAA TACCTAAACA CCTGGACAGA TGCCTTTAAC TCCGTTCCTT CCTTGCTATG 
3841 AAACAAATCC CCATTCCCAT CAGCTCCTGC CCCGTGACAG CTGTCCTTCC CTTCCCATCC 
3901 TCTCTCTGCA ACCCCAGCTC TATGAGATGT TCTCTTCGGT GATGAAACAC CTGCCAGGAC 
3961 CACAGCAACA GGCCTTTAAG GAGCTGCAAG GGCTGGAGGA CTTCATCGCC AAGAAGGTGG 
4021 AGCACAACCA GCGCACGCTG GATCCCAATT CCCCACGGGA CTTCATCGAC TCCTTTCTCA 
4081 TCCGCATGCA GGAGGTACAT CCCAGCAGCC AGTGCAGGCA GGTGCAAACC CAGGGAGAGG 
4141 GAAATCAGGA TGGGAGTGGG CTGGGCAGAC GACACAGGCC CATTCAAATT AGCCCTCGTC 
4201 ATAATAATCC TTACAATTGG CCAGGCGCGG TGGCTCATGA CCTGXAATCC CAGCACTTTG 
4261 GGAGGCCGAG GCAGGTGGAT CACCTGAGGT CAGGAGTTCG AGACCAGCCT GGCCAACATG 
4321 GTGAAACCCC GTCTCTACTA AAAATACAAA AATGAGCTAG GTATGGTGGC ATGCGCCTGT 
4381 AATCCCAGCT ACTCAGGAGG CTGAGACAGA AGAATTTGTT TGAATCCGGG AGGCAGAGGT ' 
4441 TGCAGTGAGC CGGGATCATG CCACTGCACT CCGGCCTGAG TGACAGAGCA AGACCCTGTA 
4S01 AAAAAAAAAA AAAAAAAAAA AAAAAATTCC GGAAAACCCC AATTACATCA CCCACTGCTG 
45 61 TCCCATCTAC TGAGCCCTCA CCCACAAGGA CGGGTTATGG AGGTGGATTA GATTCGAAAG 
4621 AACTTCTCAA GAACTACCGG GTGCCAGGAA CTGGGTTAAG TGTTTTATGA TAGTCCGCCA 
4681 TGGAACACTT TTAACAGTTC TTGAGGGAGG TTCACTCATG GCCCCAGTTG TACAAATGAG 
4741 GAAACTGAGG CCCAGAGAGT TTAAGTGTCT TAACTGAGGT CACAACAGTG AGGAAGACCA 
4801 TGGTCCCCCT AGCTCAAACC CTGGTCTCTC TGAGCCTATA GCTGGTGCTT TTAGCCACCA 
4861 TGCTCTCTAA CCGTTCATGT CCTGGTTAGC AGACACACCT CTGTGGACAG GTGACCTGGC 
4921 TTTACATTGC AGGGTCCCCG CCTACCTCTG GATCTCAGCC TCCCATGTGG GAAGGCTTTA 
4981 GGAAGCCAAA GCTCAGGGAG AAAGGATCAA GGGAGGGATT CCTCCACAGT AAGTTTCAAG 
S041 ATTTTTAGGG AAGAAATAGG ATGCTGTTGC TTAAAATTCT GTGCTTGTAT CTCAGAAAAA 
5101 CTCTTTTTTT CTGACTCTTC ATCTTGCCAT CTCTGTACTA CTTTCTCTTC GTCTCCCCTC 
5161 ATCCTTCTCT TTCCAAATAT TCCTATCATT AAAAAAGTAA CAGACTGGGA AACATGGCAA 
5221 AACCCCGTCT GTACAAAAAA ATGGCTAGGC ATGGTGGTGC ATGCCTGCGG TCCCAGCTAC 
5281 TAAGGAGGTT GAGGTGGGAG GATATCTTGA GCCCAGGGTG GGCAGAGCTT TCAATGACCC 
5341 GATATCACAG CCCTGCCCTC CACCCTGGGT GACAGAATAA GACCGTGTCT CCCAAAAAAA 
5401 AAAAGAATTA ATTTTTTAAC AGTTAACAAG TGAGCCTGCA TAGTCATGTG CATGTGCAGT 
5461 TCCAGCTACT CTGGAGGCTG AGACCGGAGG ATTCCTTGAA CCCACGAGTT GGAGTCCAGC 
5521 CTGTGCAACT TAGCAAGACC AAGTCTGCAT AAAAAAAAAA AAAACCAACT GACAGCTAAG 
5581 TTGACAATTA AAGGATAGAT GATCAGTGAG GTAAAGAAGG TGAGAAGGAA GAGCATTTTG 
5641 GGCAAAGCCA GCAGCCAGGG CAAGGGCTGG AACCTGGAGC GAGTTTGGCA AATCTAGGGT 
5701 CCCTCTTTCC ACCTTTGGTC TGGACCAAAG AGAGGTAGCT CCAAAGGAAA AGCCCTAGAA 
5761 GGGCCCCAAG AGCATGGAGA GTGAGCTTGG TCTAAACCGC CCTCTCCCTG CAGGAGGAGA 
5B21 AGAACCCCAA CACAGAGTTC TACTTGAAGA ACCTGGTGAT GACCACCCTG AACCTCTTCT 
5881 TTGCGGGCAC TGAGACCGTG AGCACCACCC TGCGCTACGG TTTCCTGCTG CTCATGAAGC 
5941 ACCCAGAGGT GGAGGGTAAG ACTGGAAAGG GAGGAAAGTG AAGGGCCCCA GACCCTCAAA 
6001 ACTCCCCTGA GCCTGGTGCA GTGTACCCAC CTATCCCAGA TCCCAGGACC CTGAGACGTG 
6061 CCTTGCTGTC CAGAGACAGG ACAATATTCA GCTGATAGGC ATCAGCTGAG TCTCATTAGC 
6121 TATTAAAATA TTGAAAATGT CTGCACTGAT TGGTCAGTCA CTCCTGTCCC AAGCCCACTG 
6181 AGTGTCCGCT GCCTGCTCCT CTGGATCATC CCCTAAGTTC CTCCCTTGTC CTACCCTGTG 
6241 ATTCTGACAC AACCTGGTTT AACAGGGATC CTGCTGCAAA CAATGCGAAT GGGTGATGTC 
6301 TTGTTCTTGT TTATGAATGG GCTTACCCTT CGTGTCAGAG GTGGAAGCTA TCTCAACCGC 
6361 CGTGTTTTAG CTAGGGGGGG CGATAGATGC CCTGCTCTAA GACCCCTAGA GAGGGTAAAG 
6421 ATATTCCCCT CCTCCGCCAG CCAAGGTCCA TGAGGAGATT GACAGAGTGA TCGGCAAGAA 
6481 CCGGCAGCCC AAGTTTGAGG ACCGGGCCAA GATGC CCTAC ACAGAGGCAG TGATCCACGA 
6541 GATCCAAAGA TTTGGAGACA TGCTCCCCAT GGGTTTGGCC CACAGGGTCA ACAAGGACAC 
6601 CAAGTTTCGG GA T TTCTTCC TCCCTAAGGT GCTGTCTCCC CTCCACCACC ACCACTCAGA 
6661 CTACGGGGAC TTCCAGCCTC TCTCTGTGTC CCCAGAATCC TGCCCCCATT AGTGTTCTAG 
6721 ACTCTGTCCC ACTCCCTCAA TCAGTCAAAA AAGACTTCCC CAACCACCAC ATCTGTTCCA 
6781 CCTTTCCACT TAGACAGTCC TGAGTCCTGC ATCTCGCCAG ACTCTTTGTG TCAGGAGAAT 
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6841 ACACCC=A« TTCCCAATCT TCCT^CTTA SSSS IaS^gIS 

6901 TGTGG CTT AG GGACACAAAT CTCAGGTCCC ^AAACACCC SScCCA AAGCTCCTCC 
6961 CCATCTCTCC CAAACTTCCT ^CTCAGRGA »™»AACTT CACCTGGGGS ACCCTAGAGC 
7021 CTCAGAGGTC CCCAACTCCT CCATGTCOTG CCACTCCCCG ^CT^ 
7081 CCCCTGGAGC CCCTGTGTAC TTTCACCAAT ^CCAACC ^ gagagacCCC 
7141 CCTCCTCCCT CCCAGGGCAC TGAAGTCTTC ^JTGCTGG GAAGGGGCAG 
7201 AGGTTCTTCT CCAACCCCCA GGACTGCAGT «CCAGCACT ^^^^^ TTTGCTGCCA 
7261 TTTAAGAAGA SSSS CC^SSg CGGTGTAGCC 

7321 GGCCACGGCT CACACCAGCA ^GGGCCTCTC ATTOAGCCGC CACCAGCTGA 

7381 TCGTATTTCT CCAGCTTGCA £££££ JS^SaGG TAAAAGGGAA GGAAACATC? 
7441 TACTCCCTTA ACTGCCAAGC ACCCAATAUV. CTTCAGCTCC CTGAAAAGGA 

7501 TCCCCCATAC ATTTATTTGT CTAGGGTCAC *CAGCAGATT gogcATCTAA 

~ =5 SEE SI 3= = = 

~ 55 SE S5 

7921 GGCTTCACCT TCACCCCTCC TGCCTCTCCT °™G«W£ CATCCACAAC TTTCGCTTCA 
7981 GCCTGGCCAG JA^GAOCTC TTTCTCTTCT £££££ ?££^C TTTGCCACGA 
8041 AGTCCCCTCA GTCGCCTAAG ^ATCGACG GAGGGCTGTG CTGGTCCACC 

8101 SSSaS aaSSSS SSSS GGGC^TGG GAGGGGCGGG 

8161 GCTGGTCGCC GGGGCCAGGG *~£«wcw. «GGAGAGGTG GTTAGAGGGA ACAGAAGAAA 
8221 GCTAAGAATG GGGGCAGTGG GGGAAGGAAG G^JfJ^ TC^GATCAGA GGAAGGGAAA 
8281 CAGAAGGGGC TCAGTTCACC TTGATGATGT CCTTCAGAGC aacAAGTCCG . . 

8341 CCTTACAGTA TGCTACAAAG *=™OTJATA JSSJSS SSSSg CCACACACCT : 

8401 tccctgtcag ctttcttcaa aaagcqttgc JSgaaaggt GCCGCTCAGA 

8461 CTACCAATGG GGGAAAAGTC ^^^T^aC GCAAGTCTCC AGAGGATCTT GGAACACAGA ' 
8521 AAGTTGTCTC TATCTGAAAA ^AAAAC SCJAGTGTCC J^GGA^ . 

8581 TCTGGGCCCA TAGCCCTCTA GATCGATCCT CAGAGACCTG 

SS ™=S SSES SSS^ SSSS o^^cc 

8761 GTGTCACCTG AGCTCGCTA 
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